RESPONSE 



I. Status of the Claims 

Claim 6 has been cancelled without prejudice or disclaimer as being drawn to a non-elected 
invention. Claim 7 has been amended. Claims 2-5 and 7-1 1 are therefore presently pending in the 
case. In an attempt to comply with the revised 37 C.F.R. §1.121 and for the convenience of the 
Examiner the status of the claims is described hereto as Exhibit A. 

H- Support fo r the Amended Specification and Claims 

Claim 7 has been amended in response to an objection and to further clarify the claim. 
Amendment of Claim 7 finds support throughout the specification as originally filed with particular 
support being provided by the original Claim 1 and the Sequence Listing as originally filed. 

As the amendments of Claim 7 is fully supported by the specification and claims as originally 
filed, they do not constitute new matter. Entry therefore is respectfully requested. 

m - Rejection o f Claims 2-5 and 7-11 Under 35 U.S.C. S 101 

The Action first rejects claims 2-5 and 7-1 1 under 35 U.S.C. § 101 because the claimed 
invention lacks patentable utility allegedly because the claimed invention is not supported by either a 
specific and substantial utility or a well established utility. 

The Action recognizes that specification describes metalloproteinases (especially zinc 
metalloproteases of the ADAMTS family) indicates that the sequences of the present invention encode 
a metalloprotease (in particular ADAMSTS14). 

In support of this recognized assertion, Applicants respectfully submit that SEQ ID NO:20 of 
the present invention is nearly identical to a sequence that is present in the leading scientific repository 
for biological sequence data (GENBANK). This sequence, GENBANK Accession No. 
Q8WXS8(information provided as Exhibit B) has been annotated by third party scientists wholly 
unaffiliated with Applicants as "ADAMTS- 14 precursor (A disintegrin and metalloproteinase with 
thrombospondin motifs 14) (ADAM-TS 14)". Thus Applicants' assertion that the sequences of the 
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present invention represent a variant of Q8WXS8 is supported by the evidence provided in Exhibit 
C, which contains an amino acid sequence comparison between SEQ ID NO:20 and the amino acid 
sequence of Q8WXS8. From this comparison, it can be seen that SEQ ID NO:20 shares a greater 
than 95% homology with the amino acid sequence of Q8WXS8. 

Tissues which express the sequences of the present invention were described in the 
specification (page 3, line 5) as being human spinal cord, lymph node, bone marrow, trachea, mammary 
gland, skeletal muscle, pericardium, adipose, esophagus, bladder, fetal kidney, and fetal lung cells. The 
activity of the protein encoded by the sequences of the present invention is described in several 
publications (Abstracts provided as Exhibit D). One is entitled "Characterization of ADAMTS 14, 
a novel member of the ADAMTS metalloproteinase family" by Bolz et al. , 2001 (Biochim Biophys 
Acta. 7522:221-5, 2001). A second is entitled "Cloning and characterization of ADAMTS-14, anovel 
ADAMTS displaying high homology with ADAMTS -2 and ADAMTS-3" by Colige, et al. , 2002 (J 
Biol Chem. 277:5756-66, 2002, Epub 2001 Dec 07). The third is entitled "Cloning, expression 
analysis, and structural characterization of seven novel human ADAMTSs, a family of 
metalloproteinases with disintegrin and thrombospondin-1 domains" by Cal, etal, 2002 (Gene 
283:49-62). 

Applicants respectfully submit that these publications constitutes evidence that clearly 
demonstrates that the proteins of the present invention have function and utility that are both accepted 
by those skilled in the art. As the legal test for utility simply involves an assessment of whether those 
skilled in the art would find any of the utilities described for the invention to be credible or believable. 
Clearly those of skill in the art would recognize that molecules that share such amino acid sequence 
homology would share protein structure and would thus would also share the same function. This 
constitutes evidence that clearly supports the specifications assertion that SEQ ID NO: 19 and 20 
encode a known protein (the metal loprotease ADAMSTS14). 

Applicants submit that the presently claimed molecules have been shown to encode, as asserted 
in the specification as filed, ADAMTS-14 precursor (A disintegrin and metalloproteinase with 
thrombospondin motifs 14) (ADAM-TS 14) which is known to the art. Thus, the present situation 
directly tracks Example 10 of the Revised Interim Utility Guidelines Training Materials (pages 53-55), 
which establishes that a rejection under 35 U.S.C. § 101 as allegedly lacking a patentable utility and 
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under 35 U.S.C. § 1 12, first paragraph as allegedly unusable by the skilled artisan due to the alleged 
lack of patentable utility, is not proper when a full length sequence (such as the presently claimed 
sequence), and has a similarity score greater than 95% to a protein having a known function (such as 
the 100% identity between the presently claimed sequences and those of the cited protein (Q8WXS8). 

Further evidence of utility of the presently claimed polynucleotide, although only one is needed 
to meet the requirements of 35 U.S.C. § 101 {Raytheon v. Roper, 220 USPQ 592 (Fed. Cir. 1983); 
In re Gottlieb, 140 USPQ 665 (CCPA 1964); In re Malachowski, 189 USPQ 432 (CCPA 1976); 
Hoffman v. Klaus, 9 USPQ2d 1 657 (Bd. Pat. App. & Inter. 1988)), is the specific utility the present 
nucleotide sequence has in determining the genomic structure of the corresponding human chromosome, 
for example mapping the protein encoding regions as described in the specification. As evidence 
supporting Applicants assertions of the specific utility of the sequences of the present invention in 
localizing the specific region of the human chromosome and identification of functionally active 
intron/exon splice junctions is the information provided as Exhibit E. This is the result of overlaying 
the sequence of SEQ ID NO: 19 of the present invention and the identified human genomic sequence. 
By doing this, one is able to identify the portions of the genome that encode the present invention. If 
these regions of the genome are non-contiguous, this is indicative of individual exons. The results of 
such an analysis indicates that the sequence of the present invention is encoded by at least 24 exons 
spread non-contiguously along a region of human chromosome 10, which are contained within four 
BAC clones AL335344.20, AC007484.2, AC069538. 10 and AC007484.2. Thus clearly one would 
not simply be able to identify the 24 or more protein encoding exons that make up the sequence of the 
present intention from within the large genomic sequence. Nor, would one be able to map the protein 
encoding regions identified specifically by the sequences of the present invention without knowing 
exactly what those specific sequences were. Additionally, it should be noted that the human 
ADAMSTS 14 gene also maps to the same region of human chromosome 10 (at approximately 10q2). 
This further supports Applicant's position that the sequences of the present invention encodes a variant 
of the human ADAMSTS 14. 

Therefore, clearly, the present polynucleotide provides exquisite specificity in localizing the 
specific region of the human chromosome containing the gene encoding the given polynucleotide, a 
utility not shared by virtually any other nucleic acid sequence. In fact, it is this specificity that makes this 
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particular sequence so useful. Early gene mapping techniques relied on methods such as Giemsa 
staining to identify regions of chromosomes. However, such techniques produced genetic maps with 
aresolution of only 5 to 10 megabases, far too low to be of much help in identifying specific genes 
involved in disease. The skilled artisan readily appreciates the significant benefit afforded by markers 
that map a specific locus of the human genome, such as the present nucleic acid sequence. 

Only a minor percentage of the genome actually encodes exons, which in turn encode amino 
acid sequences. The presently claimed polynucleotide sequence provides biologically validated 
empirical data (e.g., showing which sequences are transcribed, spliced, and polyadenylated) that 
specifically defines that portion of the corresponding genomic locus that actually encodes exon 
sequence. Equally significant is that the claimed polynucleotide sequence defines how the encoded 
exons are actually spliced together to produce an active transcript (i.e. , the described sequences are 
useful for functionally defining exon splice-junctions). The Applicants respectfully submit that the 
practical scientific value of expressed, spliced, and polyadenylated mRNA sequences is readily 
apparent to those skilled in the relevant biological and biochemical arts. The presently claimed 
polynucleotide sequence defines a biologically validated sequence that provides a unique and specific 
resource for mapping the genome. 

An additional utility includes the use of the presently claimed polynucleotides on DNA chips. 
Further, the Action seems to be requiring Applicants to identify the biological role of the nucleic acid 
or function of the protein encoded by the presently claimed polynucleotides before the present 
sequences can be used in gene chip applications that meet the requirements of § 101 . Applicants 
respectfully point out that knowledge of the exact function or role of the presently claimed sequence 
is not required to track expression patterns using a DNA chip. Given the widespread utility of such 
"gene chip" methods usingpublic domain gene sequence information, there can be little doubt that the 
use of the presently described novel sequences would have great utility in such DNA chip applications. 
The claimed sequence provides a specific marker of the human genome (see evidence below), and that 
such specific markers are targets for discovering drugs that are associated with human disease. Thus, 
those skilled in the art would instantly recognize that the present nucleotide sequence would be an ideal, 
novel candidate for assessing gene expression using, for example, DNA chips, as the specification 
details. Such "DNA chips" clearly have utility, as evidenced by hundreds of issued U.S. Patents, as 
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exemplified by U.S. Patent Nos. 5,445,934, 5,556,752, 5,744,305, as well as more recently issued 
U.S. Patent Nos. 5,837,832, 6,156,501 and 6,261,776. Accordingly, the present sequence has a 
specific utility in such DNA chip applications. Clearly, compositions that enhance the utility of such 
DNA chips, such as the presently claimed nucleotide sequence, must also be useful. 

Additionally, since only a small percentage of the genome (2-4%) actually encodes exons, 
which in-turn encode amino acid sequences. Thus, not all human genomic DNA sequences are useful 
in such genechip applications. Thus, the present claims clearly meet the requirements of 35 U.S.C. § 
101 . It has been clearly established that a statement of utility in a specification must be accepted absent 
reasons why one skilled in the art would have reason to doubt the objective truth of such statement. 
In re hanger, 503 F.2d 1380, 1391, 183 USPQ 288, 297 (CCPA, 1974); In re MarzocchU 439 
F.2d 220, 224, 169 USPQ 367, 370 (CCPA, 1971). 

Evidence of the "real world" substantial utility of the present invention is further provided by the 
fact that there is an entire industry established based on the use of gene sequences or fragments thereof 
in a gene chip format. Perhaps the most notable gene chip company is Affymetrix. However, there are 
many companies which have, at one time or another, concentrated on the use of gene sequences or 
fragments, in gene chip and non-gene chip formats, for example: Gene Logic, ABI-Perkin-Elmer, 
HySeq and Incyte. In addition, one such company, Rosetta Inpharmatics, was viewed to have such 
"real world" value that it was acquired by large pharmaceutical company, Merck & Co. , for substantial 
sums of money (net equity value of the transaction was $620 million). The "real world" substantial 
industrial utility of gene sequences or fragments would, therefore, appear to be widespread and well 
established. Clearly, persons of skill in the art, as well as venture capitalists and investors, readily 
recognize the utility, both scientific and commercial, of genomic data in general, and specifically human 
genomic data. Billions of dollars have been invested in the human genome project, resulting in useful 
genomic data (see, e.g., Venter et al , 2001 , Science 291 : 1304). The results have been a stunning 
success as the utility of human genomic data has been widely recognized as a great gift to humanity (see, 
^.^.JasnyandKennedy, 2001, Science 297:1153). Clearly, the usefulness of human genomic data, 
such as the presently claimed nucleic acid molecules, is substantial and credible rworthy of billionc G f 
dollars and the creation of numerous companies focused on such information) and well-established Cthe 
utility of human genomic information has been clearly understood for many years). The sequences of 



8 



the present invention have particularly specific utility in DNA gene chip based analysis as they have 
been identified to contain several coding region nucleotide polymorphisms (see above), thus increasing 
their utility in DNA gene chip based analysis. 

Finally, the Examiner is requested to consider the issue of due process. Applicants 
understanding is that issued United States patents retain a legal presumption of validity which in this case 
indicates that the inventions claimed in the cited patents are legally presumed to be in full compliance 
with the provisions of35U.S.C. sections 101, 102, 103,andll2. Applicants respectfully submit that, 
absent a change in the law as enacted by Congress and signed by the President, it is improper for the 
Examiner to hold Applicants' invention to a different legal standard of patentability. Given the rapid 
pace of development in the biotechnology arts, it is difficult for the Applicants to understand how an 
invention fully disclosed and free of prior art at the time the present application was filed, could 
somehow retain less utility and be less enabled than inventions in the cited issued U.S. patents (which 
were filed during a time when the level of skill in the art was clearly lower). Simply put, Applicants 
invention is more enabled and retains at least as much utility as the inventions described in the claims 
of the U.S. patents of record. Any argument to the contrary is at best arbitrary and at worst capricious. 
Absent authority provided by an act of Congress or Executive order, arbitrary or capricious conduct 
by an administrative office the U.S. government has historically proven to conflict with the provisions 
of the U.S. Constitution. The Patent Office does not have the authority to rewrite U.S. law. However, 
the Patent Office does have a Constitutional obligation to administer U.S. law in an unbiased and 
procedurally consistent manner. That is what the Applicants are respectfully requesting the Examiner 
to consider in the present matter. As the issued U.S. Patents cited above are presumed to meet all of 
the requirements for patentability, including 35 U.S.C. §§ 101 and 1 12, first paragraph, Applicants 
respectfully submit that the presently claimed polynucleotide must also meet the requirements of 
35 U.S.C. § 101. 

Thus in summary, Applicants submit that the presently claimed molecules have been shown to 
encode, as asserted in the specification as filed, ADAMSTS 14 isoforms, whose biological function is 
known to the art. Thus, the present situation directly tracks Example 10 of the Revised Interim Utility 
Guidelines Training Materials (pages 53-55), which establishes that a rejection under 35 U.S.C. § 101 
as allegedly lacking a patentable utility and under 35 U.S.C. § 1 12, first paragraph as allegedly unusable 
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by the skilled artisan due to the alleged lack of patentable utility, is not proper when a full length 
sequence (such as the presently claimed sequence), and has a similarity score greater than 95% to a 
protein having a known function. Furthermore this response has described a series of additional 
substantial, specific, credible and well-established utilities for the present invention. Therefore, 
Applicants submit that as the presently claimed sequence molecules have been shown to have a 
substantial, specific, credible and well-established utility, the rejection of the claims under 35 U.S.C. 
§ 101 has been overcome. Thus, Applicants respectfully request that the rejection be withdrawn. 

IV. Rejection of Claims 2-5 and 7-11 Under 35 U.S.C. § 112. First Paragraph 

The Action next rejects claims 2-5 and 7-11 under 35 U.S.C. § 112, first paragraph, as 
allegedly the claimed invention is not supported by either a specific asserted utility or a well established 
utility for the reasons set forth above, one skilled in the art clearly would not know how to use the 
claimed invention. 

Applicants respectfully submit that claims 2-5 and 7- 1 1 have been shown to have "a specific, 
substantial, and credible utility", as detailed in the section above. Therefore, one skilled in the art would 
clearly know how to use the claimed invention and Applicants therefore request that the rejection of 
claims. Therefore, Applicants submit that as the presently claimed sequence molecules have been 
shown to have a substantial, specific, credible and well-established utility, and thus the rejection of the 
claims under 35 U.S.C. § 112, first paragraph has been avoided. Thus, Applicants respectfully request 
that the rejection be withdrawn. 

V. Rejection of Claim 7 Under 35 U.S.C. § 112. Second Paragraph 

The Action next rejects claim 7 under 35 U.S.C. § 1 12, second paragraph, as being allegedly 
being indefinite for failing to particularly point out and distinctly claim the subject matter which applicant 
regards as the invention. While Applicants in no way agree, Claim 7 has been amended to replace the 
term "drawn" with the term 'selected". 

Therefore, Applicants submit that amended Claim 7 is clearly definite and particularly points 
out and distinctly claims the subject matter which applicant regards as the invention and thus the 
rejection of theclaim under 35 U.S.C. § 11 2, first paragraph has been avoided. Applicants, therefore, 
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respectfully request that the rejection be withdrawn. 



VI. Conclusion 

The present document is a full and complete response to the Action. In conclusion, Applicants 
submit that, in light of the foregoing remarks, the present case is in condition for allowance, and such 
favorable action is respectfully requested. Should Examiner Moore have any questions or comments, 
or believe that certain amendments of the claims might serve to improve their clarity, a telephone call 
to the undersigned Applicants' representative is earnestly solicited. 



Respectfully submitted, 



August L2003 




Lance K. Ishimoto Reg. No. No. 41,866 

Agent for Applicants 



Date 



LEXICON GENETICS INCORPORATED 
(281) 863-3333 



III11IIHI 



24231 



PATENT TRADEMARK OFFICE 
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Exhibit A 
Status of Claims in 
U.S. Patent Application Ser. No. 09/938.330 



1. (cancelled) 

2. (original) An isolated nucleic acid molecule comprising a nucleotide sequence that 

(a) encodes the amino acid sequence shown in SEQ ED NO: 20; and 

(b) hybridizes under stringent conditions to the nucleotide sequence of SEQ ID 
NO: 19 or the complement thereof. 

3. (original) An isolated nucleic acid molecule according to Claim 2 wherein said 
nucleotide sequence is present in cDNA. 

4. (original) An isolated nucleic acid molecule encoding the amino acid sequence 
presented in SEQ ID NO:20. 

5. (original) An isolated nucleic acid molecule encoding the amino acid sequence 
presented in SEQ ID NO:22. 

6. (cancelled) 

7. (currently amended) An isolated nucleic acid molecule comprising a nucleotide 
sequence encoding an amino acid sequence drawn selected from the group consisting of SEQ ID 
NOS:20 and 22. 

8. (previously presented) An expression vector comprising a nucleic acid sequence of 

Claim 4. 
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9. (previously presented) A cell comprising the expression vector of Claim 8. 

10. (previously presented) An expression vector comprising a nucleic acid sequence 

of Claim 5. 

1 1. (previously presented) A cell comprising the expression vector of Claim 10. 
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□ 1: Q8WXS8. ADAMTS-14 precurs...[gi:29337086] 



BLink, Domains, Links 



LOCUS Q8WXS8 1223 aa linear PRI 15-SEP-2003 

DEFINITION ADAMTS-14 precursor (A disintegrin and metalloproteinase with 

thrombospondin motifs 14) (ADAM-TS 14) (ADAM-TS14) . 
ACCESSION Q8WXS8 
VERSION Q8WXS8 GI: 29337086 

DBSOURCE swissprot: locus ATI 4_HUMAN , accession Q8WXS8; 
class: standard. 

extra accessions : Q8TE55 , Q8TEY8 , created : Feb 28, 2003. 
sequence updated: Feb 28, 2003. 
annotation updated: Sep 15, 2003. 

xrefs: gi: 17483853 , gi : 17483854 , gi : 19171187 , gi : 19171188 , gi : 
18874445 , gi : 18874446 

xrefs (non-sequence databases): MEROPSM12 . 024 , GenewHGNC : 14899 , MIM 
607506 , InterProIPR001762 , InterProIPR001818 , InterProIPR002870 , 
InterProIPR001590 , InterProIPR000884 , InterProIPR008085 , 
InterProIPR006025, Pf amPF01562 , PfamPF01421, PfamPF00090, 
PRINTSPR01705, SMARTSM0 02 09, PROSITEPS50215 , PROSITEPS00546 , 
PROSITEPS00427, PROSITEPS50214 , PROSITEPS50092 , PROSITEPS00142 
Hydrolase; Metalloprotease; Zinc; Signal; Glycoprotein; Zymogen; 
Collagen degradation; Repeat; Extracellular matrix; Alternative 
splicing; Alternative promoter usage. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (residues 1 to 1223) 

Bolz,H., Ramirez, A., von Brederlow,B. and Kubisch,C. 
Characterization of ADAMTS14, a novel member of the ADAMTS 
metalloproteinase family 
Biochim . Biophys . 
21638061 



KEYWORDS 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 

TITLE 



Acta 1522 (3), 221-225 (2001) 



Llamazares , M . , Garabaya , C . , Quesada , V . and 



SEQUENCE FROM N.A. (ISOFORM A) 

2 (residues 1 to 1223) 
Cal, S. , Obaya, A. J. 
Lopez-Otin, C . 

Cloning, expression analysis, and structural characterization of 
seven novel human ADAMTSs, a family of metalloproteinases with 
disintegrin and thrombospondin-1 domains 
Gene 283 (1-2), 49-62 (2002) 
21856482 

SEQUENCE FROM N.A. (ISOFORM A) . 
TISSUE=Fetal lung 

3 (residues 1 to 1223) 

Colige,A., Vandenberghe , I . , Thiry,M., Lambert , C .A. , Van Beeumen , J . , 
Li,S.W., Prockop, D . J . , Lapiere,C.M. and Nusgens,B.V. 
Cloning and characterization of ADAMTS-14, a novel ADAMTS 
displaying high homology with ADAMTS -2 and ADAMTS -3 
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JOURNAL 
MEDLINE 
REMARK 

COMMENT 



277 (8), 5756-5766 (2002) 



FEATURES 

source 



gene 



J. Biol. Chem. 
21839041 

SEQUENCE OF 29-1223 FROM N.A. 
ALTERNATIVE PROMOTER USAGE . 



(ISOFORMS B; C AND D) , AND 



Protein 



Region 



Region 



Region 



Site 



This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: / /www. expasy . ch/sprot 
and ht tp : / /www . ebi . ac . uk/ spro t 

[FUNCTION] Has a aminoprocollagen type I activity processing 
activity in the absence of ADAMTS2 . Seems to be synthesized as a 
latent enzyme that requires activation to display aminoprocollagen 
peptidase activity. 

[SUBCELLULAR LOCATION] Secreted. Associated with the extracellular 
matrix (By similarity) . 

[ALTERNATIVE PRODUCTS] Event=Al ternative promoter; Comment=2 
isoforms, A (shown here) and B, are produced by use of alternative 
promoters; Event =A1 ternative splicing; Named isoforms=4; Name=A; 
IsoId=Q8WXS8-l; Sequence=Displayed; Name=B; IsoId=Q8WXS8-2 ; 
Sequence=VSP_006958 ; Name=C ; IsoId=Q8WXS8-3 ; Sequence=VSP_00 6958 , 
VSP_005501; Note=Produced by alternative splicing of isoform B; 
Name=D; IsoId=Q8WXS8-4 ,- Sequence=VSP_0 05501 ; Note=Produced by 
alternative splicing of isoform A. 

[TISSUE SPECIFICITY] Expressed in retina and at low levels in 
brain, lung and placenta. High expression in fetal tissues. 
[DOMAIN] The spacer domain and the TSP type-1 domains are important 
for a tight interaction with the extracellular matrix (By 
similarity) . 

[PTM] The precursor is cleaved by a furin endopeptidase (By 
similarity) . 

[SIMILARITY] Belongs to peptidase family M12B. 
[SIMILARITY] Contains 1 disintegrin-like domain. 
[SIMILARITY] Contains 1 PLAC domain. 
[SIMILARITY] Contains 4 TSP type-1 domains. 

Location /Qualifiers 

1. .1223 

/organism^ "Homo sapiens" 
/ db_xre f = " t axon : 9 6 0 6 " 
1. .1223 

/gene= "ADAMTS14 " 
1 . .1223 

/ gene = " ADAMTS 1 4 " 
/product="ADAMTS-14 precursor" 
/ EC_number= " 3.4.24.- " 
1. .68 

/gene= "ADAMTS 14 " 

/region_name=" Splicing variant" 
/note="Missing (in isoform B and isoform C) . 
/FTId=VSP_006958 . " 
1. .22 

/gene= " ADAMTS 14 " 
/region_name= " Signal " 
/note=" POTENTIAL. " 
23 . .252 

/ gene= " ADAMTS 1 4 " 
/region_name=" Propeptide " 
/note="BY SIMILARITY." 
109 
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/ gene = "ADAMTS 14 " 
/site_type= "glycosylation" 

/ note= "N-L INKED (GLCNAC. . . ) (POTENTIAL) . " 
Region 253 . . 1223 

/ gene= " ADAMTS 1 4 " 

/region_name= "Mature chain" 

/note= " ADAMTS-14 . " 
Region 253 . .460 

/ gene= " ADAMTS 1 4 " 

/ region_name= " Domain " 

/note= "METALLOPROTEASE . " 
Region 368 

/ gene = "ADAMTS 14 " 

/region_name=" Splicing variant" 

/note= n G -> GMQG (in isoform C and isoform D) 

/FTId=VSP_005501. " 
Site 398 

/ gene= "ADAMTS14 " 

/ si te_type="metal -binding " 

/note= n ZINC (CATALYTIC) (BY SIMILARITY)." 
Site 399 

/ gene =" ADAMTS 1 4 " 

/sit e__ t ype ="act i ve " 

/note= n BY SIMILARITY." 
Site 402 

/gene= "ADAMTS 14 11 

/site_type= "metal-binding " 

/note= n ZINC (CATALYTIC) (BY SIMILARITY)." 
Site 408 

/ gene- " ADAMTS 1 4 " 
/site_type=" metal-binding " 

/note="ZINC (CATALYTIC) (BY SIMILARITY)." 
Region 461. .551 

/gene= "ADAMTS 14 " 

/ r e g i on_name = " Doma i n " 

/note= "DISINTEGRIN-LIKE . " 
Site 475 

/ gene= " ADAMTS 1 4 " 

/site_type= "glycosylation " 

/ note- "N-LINKED (GLCNAC. . . ) (POTENTIAL) . " 
Region 552 . . 607 

/ gene= " ADAMTS 1 4 " 

/ region_name= " Domain " 

/note="TSP TYPE-1 1." 
Region 608. .729 

/ gene = - ADAMTS 1 4 " 

/ r eg i on_name = " Doma i n " 

/note="CYS-RICH. " 
Region 730 . . 846 

/gene= " ADAMTS 14 " 

/ region_name= " Domain " 

/note=" SPACER. " 
Region 847 . .907 

/ gene= " ADAMTS 14 " 

/ region_name= " Domain " 

/note="TSP TYPE-1 2." 
Region 868 

/gene= "ADAMTS 14 " 

/region_name=" Conflict" 

/note="Q -> R (IN REF. 2)." 
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Region 



Region 



Region 



Region 



Region 



Site 



Region 



Region 



Site 



Region 



Region 



875. .878 

/gene=" ADAMTS14" 
/ r e g i on_name = " Doma i n " 
/note= " POLY-ARG . " 
884 

/gene= " ADAMTS14 " 
/region_name= "Conflict" 
/note="Q -> H (IN REF . 2) 
901 

/gene= " ADAMTS14 11 
/region_name=" Conflict" 
/note="C -> S (IN REF. 2) 
908. .967 

/ gene = " ADAMTS 1 4 " 
/ r eg i on_name = " Doma in " 
/note= n TSP TYPE-1 3." 
923 

/gene= "ADAMTS 14 " 
/region_name= " Conflict " 
/note="C -> Y (IN REF. 2) 
941 

/ gene= " ADAMTS 1 4 " 
/site_type= " glycosylation 
/note= "N-LINKED (GLCNAC. . 
968 . . 1022 
/gene= "ADAMTS 14 " 
/region_name= " Domain " 
/note="TSP TYPE-1 4." 
1024 

/ gene = " ADAMTS 1 4 " 
/region_name= "Conflict " 
/note="N -> S (IN REF. 2)." 
1027 

/gene= " ADAMTS 14 " 

/site_type= "glycosylation" 

/no te= "N-LINKED (GLCNAC . . . ) 

1063 . .1093 

/gene= " ADAMTS 14 " 

/region_name= " Domain " 

/note=" PLAC. " 

1100. .1221 

/ gene = " ADAMTS 1 4 " 

/ r egi on_name = " Doma in 11 

/note=" PRO-RICH. " 



) (POTENTIAL) 



(POTENTIAL) 



ORIGIN 



1 maplrallsy 
61 aaasagsmw 
121 lrpnrrlwp 
181 dstdffiepl 
241 gdqlgdterk 
3 01 gvhinialvr 
361 rqdfgpsgya 
421 lgsvmaplvq 
481 qcrfdfgsgy 
541 iwkspeqtyg 
601 cnseecpgty 

6 61 mnqwhdgtr 
721 gkaskqagal 

7 81 mglewedave 
841 vlleemdtye 



llplhcalca 
dtpptlprhs 
gssvewqedf 
ergqqekeas 
rrhakpgsys 
limvgyrqsl 
pvtgmchplr 
aaf hrf hwsr 
qtclaf rtf e 
qdggwsswtk 
edf raqqcak 
csyrdpysvc 
klvqipagar 
dakeslktsg 
walkswapcs 



aagsrtpelh 
shlrvarspl 
relf rqplrq 
grthwyrre 
ievllwdds 
sliergnpsr 
scalnhedgf 
csklelsryl 
pckqlwcshp 
f gscsrscgg 
rnsyyvhqna 
argecvpvgc 
hiqiealeks 
plpeaiaila 
kacgggiqf t 



Isgklsdygv 
hpggtlwpgr 
ecvytggvtg 
avqqewaepd 
wrf hgkehv 
sleqvcrwah 
ssafviahet 
psydcllddp 
dnpyf cktkk 
gvrsrsrscn 
khswvpyepd 
dkevgsmkad 
phriwknqy 
lppteggprs 
kygcrrrrdh 



tvpcstdf rg 
vgrhslyf nv 
mpgaavaisn 
gdlhneafgl 
qnyvltlmni 
sqqrqdpsha 
ghvlgmehdg 
f dpawpqppe 
gppldgteca 
npspayggrp 
ddaqkcelic 
dkcgvcggdn 
tgsf ilnpkg 
slaykyvihe 
hmvqrhlcdh 



rf lshwsgp 
tvf gkelhlr 
cdglaglirt 
gdlpnllglv 
vdeiyhdesl 
ehhdhwf It 
qgngcadets 
lpginysmde 
pgkwcf kghc 
clgpmf eyqv 
qsadtgdwf 
shcrtvkgtl 
keatsrtf ta 
dllpligsnn 
kkrpkpirrr 
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901 cnqhpcsqpv wvteewgacs rscgklgvqt 

961 rpclrvpcpa qwrlgawsqc satcgegiqq 

1021 cggnhqnstv radvwelgtp egqwvpqsep 

1081 sipgyhrlcc vscikkasgp npgpdpgpts 

1141 edhqhgratq lpgaldtssp gtqhpfapet 

1201 dkgqpgedlr hpgtslpaas pvt 



rgiqcllpls ngthkvmpak acagdrpear 
rqwcrtnan slghcegdrp dtvqycslpa 
lhpinkisst epctgdrsvf cqmevldryc 
lppfstpgsp lpgpqdpada aeppgkptgs 
pipgaswsis pttpgglpwg wtqtptpvpe 



Disclaimer | Write to the Help Desk 
NCBI | NLM | NIH 



Jul 17 2003 11:56:53 
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FASTA searches a protein or DNA sequence data bank 
version 3.3t05 March 30, 2000 
Please cite: 

W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 

/tmp/fastaCAARWaOhY: 1252 aa 

>Lex 221 SEQ ID NO: 20 

vs /tmp/fastaDAASWaOhY library 
searching /tmp/ f astaDAASWaOhY library 

122 3 residues in 1 sequences 

FASTA (3.34 January 2000) function [optimized, BL50 matrix (15: -5)] ktup : 2 

join: 40, opt: 28, gap-pen: -12/ -2, width: 16 

Scan time: 0.0 67 
The best scores are: opt 
gi|29337086| sp | Q8WXS8 | AT14_HUMAN ADAMTS-14 precur (1223) 8686 

>>gi | 29337086 | sp | Q8WXS8 | AT14_HUMAN ADAMTS-14 precursor ( (1223 aa) 

initn: 8683 initl: 7506 opt: 8686 
Smith-Waterman score: 8686; 98.611% identity in 1224 aa overlap (1-1224:1-1221) 



Lex 



10 20 30 40 50 60 

MAPLRALLSYLLPLHCALCXAAGSRTPELHLSGKLSDYGVWPCSTDFRGRFLSHWSGP 



gi | 2 93 MAPLRALLSYLLPLHCALCAAAGSRTPELHLSGKLSDYGVTVPCSTDFRGRFLSHWSGP 
10 20 30 40 50 60 



Lex 



70 80 90 100 110 120 

AAASAGSMWDTPPTLPRHSSHLRVARSPLHPGGTLWPGRVGRHSLYFNVTVFGKELHLR 



gi | 2 93 AAASAGSMVVDTPPTLPRHSSHLRVARSPLHPGGTLWPGRVGRHSLYFWTVFGKELHLR 
70 80 90 100 110 120 



Lex 



130 140 150 160 170 180 

LRPNRRLWPGSSVEWQEDFRELFRQPLRQECVYTGGVTGMPGAAVAISNCDGLAGLIRT 



gi | 293 LRPNRRLWPGSSVEWQEDFRELFRQPLRQECVYTGGVTGMPGAAVAISNCDGLAGLIRT 
130 140 150 160 170 180 



Lex 



190 200 210 220 230 240 

DSTDFFIEPLERGQQEKEASGRTHWYRREAVQQEWAEPDGDLHNEAFGLGDLPNLLGLV 



gi | 2 93 DSTDFFIEPLERGQQEKEASGRTHWYRREAVQQEWAEPDGDLHNEAFGLGDLPNLLGLV 
190 200 210 220 230 240 



Lex 



250 260 270 280 290 300 

GDQLGDTERKRRHAKPGS YS I EVTjL VVDDS VVRFHGKEHVQNYVLTLMNI VDE I YHDESL 



gi | 293 GDQLGDTERKRRHAKPGSYSIEVLLWDDSWRFHGKEHVQNYVLTLMNIVDEIYHDESL 
250 260 270 280 290 300 



310 320 330 340 350 360 

Lex GVHINIALVRLIMVGYRQSLSLIERGNPSRSLEQVCRWAHSQQRQDPSHAEHHDHWFLT 



gi | 2 93 GVHINIALVRLIMVGYRQSLSLIERGNPSRSLEQVCRWAHSQQRQDPSHAEHHDHWFLT 
310 320 330 340 350 360 



370 380 390 400 410 420 

Lex RQDFGPSGYAPVTGMCHPLRSCALNHEDGFSSAFVIAHETGPiVLGMEHDGQGNGCADETS 
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gi | 293 RQDFGPSGYAPVTGMCHPLRSCALNHEDGFSSAFVIAHETGHVLGMEHDGQGNGCADETS 
370 380 390 400 410 420 

430 440 450 460 470 480 

Lex LGSVMAPLVQAAFHRFHWSRCSKLELSRYLPSYDCLLDDPFDPAWPQPPELPGINYSMDE 



gi | 293 LGSVMAPLVQAAFHRFHWSRCSKLELSRYLPSYDCLLDDPFDPAWPQPPELPGINYSMDE 
430 440 450 460 470 480 

490 500 510 520 530 540 

Lex QCRFDFGSGYQTCLAFRTFEPCKQLWCSHPDNXXFCKTKKGPPLDGTECAPGKWCFKGHC 



gi | 2 93 QCRFDFGSGYQTCLAFRTFEPCKQLWCSHPDNPYFCKTKKGPPLDGTECAPGKWCFKGHC 
490 500 510 520 530 540 

550 560 570 580 590 600 

Lex IWKSPEQTYGdDGGWSSWTKFGSCSRSCGGGVRSRSRSCNNPSPAYGGRXCLGPMFEYQV 



gi | 2 93 IWKSPEQTYGQDGGWSSWTKFGSCSRSCGGGVRSRSRSCNNPSPAYGGRPCLGPMFEYQV 
550 560 570 580 590 600 

610 620 630 640 650 660 

Lex CNSEECPGTYEDFRAQQCAKRNXYYVHQNAKHXWVPYEPDDDAQKCELICQSADXGDWF 



gi | 2 93 CNSEECPGTYEDFRAQQCAKRNSYYVHQNAKHSWVPYEPDDDAQKCELICQSADTGDWF 
610 620 630 640 650 660 

670 680 690 700 710 720 

Lex MNQWHDGTRCSYRDPYSVCARGECVPVGCDKEVGSMKADDKCGVCGGDNSHCRTVKGTL 



gi | 2 93 MNQWHDGTRCSYRDPYSVCARGECVPVGCDKEVGSMKADDKCGVCGGDNSHCRTVKGTL 
670 680 690 700 710 720 

730 740 750 760 770 780 

Lex GKASKQAGALKLVQIPAGARHIQIEALEKSPHXXWKNQVTGSFILNPKGKEATSRTFTA 

gi | 2 93 GKASKQAGALKLVQIPAGARHIQIEALEKSPHRIWKNQVTGSFILNPKGKEATSRTFTA 
730 740 750 760 770 780 

790 800 810 820 830 840 

Lex MGLEWEDAVEDAKESLKTSGPLPEAIAILALPPTEGGPRSSLAYKYVIHEDLLPLIGSNN 



gi | 2 93 MGLEWEDAVEDAKESLKTSGPLPEAIAILALPPTEGGPRSSLAYKYVIHEDLLPLIGSNN 
790 800 810 820 830 840 

850 860 870 880 890 900 

Lex VLLEEMDTYEWALKSWAPCSKACGGGIQFTKYGCRRRRDHHMVQRHLCDHKKRPKPIRRR 

gi | 2 93 VLLEEMDTYEWALKSWAPCSKACGGGIQFTKYGCRRRRDHHMVQRHLCDHKKRPKPIRRR 
850 860 870 880 890 900 

910 920 930 940 950 960 

Lex CNQHPCSQPVWVTEEWGACSRSCGKLGVQTRGIQCLLPLSNGTHKVMPAKACAGDRPEAR 



gi | 2 93 CNQHPCSQPVWVTEEWGACSRSCGKLGVQTRGIQCLLPLSNGTHKVMPAKACAGDRPEAR 
910 920 930 940 950 960 



970 980 990 1000 1010 1020 

Lex RPCLRVPCPAQWRLGAWSQCSATCGEGIQQRQWCRTNANSLGHCEGDRPDTVQVCXLPA 
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gi | 2 93 RPCLRVPCPAQWRLGAWSQCSATCGEGIQQRQWCRTNANSLGHCEGDRPDTVQVCSLPA 
970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

Lex CGGNHQNSTVRADVWELGTPEGQWPQSXPLHPINKISSMCAAEPCTGDRSVFCQMEVLD 

gi | 293 CGGNHQNSTVRADVWELGTPEGQWVPQSEPLHPINKISS TEPCTGDRSVFCQMEVLD 

1030 1040 1050 1060 1070 

1090 1100 1110 1120 113 0 1140 

Lex RYCSIPGYHRLCCVSCIKKASGPNPGPDPGPTSLPPFSTPGSPLPGPQDPADAAEPPGKP 



gi | 293 RYCSIPGYHRLCCVSCIKKASGPNPGPDPGPTSLPPFSTPGSPLPGPQDPADAAEPPGKP 
1080 1090 1100 1110 1120 1130 

1150 1160 1170 1180 1190 1200 

Lex TGSEDHQHGRATQLPGALDTSSPGTQHPFAPETPIPGASWSISPTTPGGLPWGWTQTPTP 



gi | 2 93 TGSEDHQHGRATQLPGALDTS S PGTQHPFAPETP I PGASWS I S PTTPGGLPWGWTQTPTP 
1140 1150 1160 1170 1180 1190 

1210 1220 1230 1240 1250 

Lex VPEDKGQPGEDLRHPGTSLPADLPGRPPEPCHPTGTFTLCVLPRDSQLRGHT 



gi | 2 93 VPEDKGQPGEDLRHPGTSLPAASPVT 
1200 1210 1220 



1252 residues in 1 query sequences 
1223 residues in 1 library sequences 
Scomplib [version 3.3t05 March 30, 2000] 

start: Tue Jul 29 17:17:39 2003 done: Tue Jul 29 17:17:40 2003 
Scan time: 0.067 Display time: 2.117 

Function used was FASTA 




J Biol Chem. 2002 Feb 22;277(8):5756-66. Epub 2001 Dec 07. Related Articles, Links 



foil text article at 
www.jbc.org 



Cloning and characterization of ADAMTS-14, a novel ADAMTS 
displaying high homology with ADAMTS-2 and ADAMTS-3. 

Colige A, Vandenberghe I, Thiry M, Lambert CA, Van Beeumen J, Li SW, 
Prockop DJ, Lapiere CM, Nusgens BV. 

Laboratory of Connective Tissues Biology, Experimental Cancerology Research 
Center, Tour de Pathologie (B23/3), University of Liege, B-4000 Liege, Belgium. 

The processing of amino- and carboxyl-propeptides of fibrillar collagens is 
required to generate collagen monomers that correctly assemble into fibrils. 
Mutations in the ADAMTS2 gene, the aminopropeptidase of procollagen I and II, 
result in the accumulation of non-fully processed type I procollagen, causing 
human Ehlers-Danlos syndrome type VIIC and animal dermatosparaxis. In this 
study, we show that the aminopropeptide of type I procollagen can be cleaved in 
vivo in absence of ADAMTS-2 activity and that this processing is performed at 
the cleavage site for ADAMTS-2. In an attempt to identify the enzyme 
responsible for this alternative aminoprocollagen peptidase activity, we have 
cloned the cDNA and determined the primary structure of human and mouse 
ADAMTS- 14, a novel ADAMTS displaying striking homologies with ADAMTS- 
2 and -3. The structure of the human gene, which maps to 10q21.3, and the 
mechanisms of generation of the various transcripts are described. The existence 
of two sites of initiation of transcription, in two different promoter contexts, 
suggests that transcripts resulting from these two sites can be differently 
regulated. The tissue distribution of ADAMTS-14, the regulation of the gene 
expression by various cytokines and the activity of the recombinant enzyme are 
evaluated. The potential function of ADAMTS-14 as a physiological 
aminoprocollagen peptidase in vivo is discussed. 



PMID: 11741898 [PubMed - indexed for MEDLINE] 



Gene. 20 02 Jan 23;283 (l-2):49-62. Related Articles, Links 
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FULL-TEXT ARTICLE 



Cloning, expression analysis, and structural characterization of 
seven novel human ADAMTSs, a family of metalloproteinases with 
disintegrin and thrombospondin-1 domains. 

Cal S, Obaya AJ, Llamazares M, Garabaya C, Quesada V, Lopez-Otin C. 

Departamento de Bioquimica y Biologia Molecular, Facultad de Medicina, 
Instituto Universitario de Oncologia, Universidad de Oviedo, 33006, Oviedo, 
Spain. 

ADAMTS (A Disintegrin And Metalloproteinase domain, with ThromboSpondin 
type-1 modules) is a recently described family of zinc-dependent proteases which 
play important roles in a variety of normal and pathological conditions, including 
arthritis and cancer. In this work, we report the identification and cloning of 
cDNAs encoding seven new human ADAMTSs. These novel enzymes have been 
called ADAMTS-13, -14, -15, -16, -17, -18, and -19. All of them show a domain 
organization similar to that of previously characterized family members, 
consisting of a signal sequence, a propeptide, a metalloproteinase domain, a 
disintegrin-like domain, a cysteine-rich region, and a variable number of TS-1 
repeats. Expression analysis revealed that these ADAMTS genes are mainly 
expressed in fetal tissues, especially in lung (ADAMTS 14, ADAMTS 16, 
ADAMTS 17, ADAMTS 18, and ADAMTS 19), kidney (ADAMTS 14, 
ADAMTS 15, and ADAMTS 16), and liver (ADAMTS 13, ADAMTS 15 and 
ADAMTS 18). Reverse transcriptase—polymerase chain reaction analysis also 
revealed the expression of some of these new ADAMTSs in different human adult 
tissues, such as prostate (ADAMTS 13, ADAMTS 17, and ADAMTS 18), and 
brain (ADAMTS 13, ADAMTS 16, ADAMTS 17, and ADAMTS 18). High levels 
of ADAMTSs transcripts were also observed in some tumor biopsies and cells 
lines, including osteosarcomas (ADAMTS 19), melanoma and colon carcinoma 
cells (ADAMTS 13). Chromosomal location analysis indicated that the seven 
identified ADAMTS genes are dispersed in the human genome mapping to 9q34, 
10q21, llq25, 5pl5, 15q24, 16q23, and 5q31, respectively. According to these 
results, together with a comparative analysis of ADAMTSs in other eukaryotic 
organisms, we conclude that these enzymes, with at least 18 distinct members 
encoded within the human genome, represent an example of a widely expanded 
protease family during metazoan evolution. 

PMID: 1 1867212 [PubMed - indexed for MEDLINE] 
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FULL-TEXT ARTICLE 



Characterization of ADAMTS14, a novel member of the ADAMTS 
metalloproteinase family. 

Bolz H, Ramirez A, von Brederlow B, Kubisch C. 

Institut fur Hmangentik, Universitats-Klinikum Eppendorf, Hamburg, Germany. 
bolz@uke.uni-hamburg.de 

ADAMTS (a disintegrin-like and metalloproteinase domain with thrombospondin 
type 1 modules) proteins constitute a family of zinc metalloproteinases which 
target and process extracellular matrix proteins. We cloned and characterized a 
novel human ADAMTS gene, ADAMTS 14, which is located on human 
chromosome 10q2. ADAMTS 14 exhibits the characteristic multi domain structure 
of ADAMTS proteins including four thrombospondin modules and shows highest 
similarity to ADAMTS 3 and ADAMTS2. By RT-PCR analysis we demonstrated 
that ADAMTS 14 is expressed in human retina and also at low levels in adult 
brain, lung and placenta. 



PMID: 11779638 [PubMed - indexed for MEDLINE] 




EXHIBIT "E" 



MeqaBlast 



MEGABLAST 1 . 2 . 3 -Paracel [2001-11-20] 
Reference : 

Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), 
"A greedy algorithm for aligning DNA sequences", 
J Comput Biol 2000; 7 ( 1-2 ): 203-14 . 
Database : Homo_sapiens . latestgp .masked. fa 

33,840 sequences; 200,810,911,373 total letters 



Query= LEX22 lseqidl9 

(3759 letters) 

Sequences producing significant alignments: 

AL355344. 20. 1.149490 
AC007484. 2 .50228.57677 
AC069538. 10. 1.169772 
AC007484. 2 .23 991 .2 92 66 
AC007484. 2 .5777 8.652 89 
AC007484. 2 .112 555.12463 6 
AC010216. 8. 1.110753 



Score E 
(bits) Value 



882 
854 
654 
654 
387 
314 
139 



0.0 

0.0 

0.0 

0.0 

e-104 

3e-82 

le-29 



>AL355344. 20 .1.149490 

Length = 149490 



Score = 882 bits (444) , Expect 
Identities = 444/444 (100%) 
Strand = Plus / Plus 



0.0 



Query: 80 cagagctgcacctctctggaaagctcagtgactatggtgtgacagtgccctgcagcacag 139 

III 1 1 1 IIIMIIIIII 1 1 Mill I MIIIMI I MM 1 1 IIIIIM Mill M I MM! 

Sbjct : 147159 cagagctgcacctctctggaaagctcagtgactatggtgtgacagtgccctgcagcacag 147218 
Query: 140 actttcggggacgcttcctctcccacgtggtgtctggcccagcagcagcctctgcaggga 199 

1 1 1 1 1 1 i 1 1 1 1 1! 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct : 147219 act ttcggggacgcttcctctcccacgtggtgtctggcccagcagcagcctctgcaggga 147278 
Query: 2 00 gcatggtagtggacacgccacccacactaccacgacactccagtcacctccgggtggctc 259 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I E 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 

Sbjct : 14727 9 gcatggtagtggacacgccacccacactaccacgacac tccagtcacctccgggtggctc 1473 3 8 
Query: 260 gcagccctctgcacccaggagggaccctgtggcctggcagggtggggcgccactccctct 319 

1 1 1 1 1 1 1 1 1 1 1 E I E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 IE 1 1 1 1 1 

Sbjct : 147339 gcagccctctgcacccaggagggaccctgtggcctggcagggtggggcgccactccctc t 147398 



Query: 320 acttcaatgtcactgttttcgggaaggaactgcacttgcgcctgcggcccaatcggaggt 379 

1 1 1 I I I I 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 ! i I 1 1 1 1 1 II 1 1 1 1 I I I I 1 1 I I I I I i I M I I 1 1 M I 

Sbjct : 1473 99 acttcaatgtcactgttttcgggaaggaactgcacttgcgcctgcggcccaatcggaggt 147458 



Query: 380 



tggtagtgccaggatcctcagtggagtggcaggaggattttcgggagctgttccggcagc 43 9 



IMIMMMIIII III 1 IMMMMIIIMMIIMI IMIIIIIMIIMIMIII 

Sbjct : 147459 tggtagtgccaggatcc tcagtggagtggcaggaggattttcgggagctgttccggcagc 147518 
Query: 440 ccttacggcaggagtgtgtgtacactggaggtgtcactggaatgcctggggcagctgttg 499 

II I I I ] I II I I I I I I I I I I II I I I II I I MM I II I I I I II I MINI MM I I Mil M 

Sbjct : 147519 ccttacggcaggagtgtgtgtacactggaggtgtcactggaatgcctggggcagctgttg 147578 
Query: 500 ccatcagcaactgtgacggattgg 523 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 147579 ccatcagcaactgtgacggattgg 147602 



Score = 159 bits (80), Expect = le-35 
Identities = 81/82 (98%) 
Strand = Plus / Plus 

Query: 1 atggctccactccgcgcgctgctgtcctacctgctgcctttgcactgtgcgctctgcrcc 6 0 

IIIIMI lllllll III llllllll I I IIIIIIIIIIIIMIIIIMIIIIIIIMI II 

Sbjct : 145409 atggc tccactccgcgcgctgctgtcctacctgctgcctttgcactgtgcgctctgcgcc 145468 
Query: 61 gccgcgggcagccggaccccag 82 

IIIIIIIIIIIIIIMIIIIII 

Sbjct: 145469 gccgcgggcagccggaccccag 145490 



>AC007484. 2 .50228. 57677 
Length = 7450 

Score = 854 bits (430), Expect = 0.0 
Identities = 441/444 (99%), Gaps = 3/444 (0%) 
Strand = Plus / Minus 

Query : 8 0 cagagc tgcacc tc tctggaaagc tcagtgac tatggtgtgacagtgccctgcagcacag 13 9 

llllll lllllllll III IIIIIIIIMIIIIIIIIIIIIIIIMIMIIIIIIIIIII 

Sbjct: 5458 cagagctgcacctctctggaaagctcagtgactatggtgtgacagtgccctgcagcacag 5399 
Query: 140 actttcggggacgcttcctctcccacgtggtgtctggcccagcagcagcctctgcaggga 199 

IIIIMIIIIIMII III MIMIIMMMMIMMI IMIIIMIIIIIIIIMII 

Sbjct : 5398 actttcggggacgcttcctctcccacgtggtgtctggcccagcagcagcctctgcaggga 5339 
Query: 200 gcatggtagtggacacgccacccacactaccacgacactccagtcacctccgggtggctc 259 

IIIIIIIIIIIIIMIIM I lllllll I IIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Sbjct: 5338 gcatggtagtggacacgccacccacactaccacgacactccagtcacctccgggtggctc 5279 
Query: 260 gcagccctctgcacccaggagggaccctgtggcctggcagggtggggcgccactccctct 319 

lllllll MIIIIIMIIIIIIIIIII IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Sbjct: 5278 gcagccctctgcacccaggagggaccctgtggcctggcagggtggggcgccactccctct 5219 



Query: 320 acttcaatgtcactgttt tcgggaaggaactgcacttgcgcctgcggcccaatcggaggt 379 

MM IMIIII I Ml Ml I II Mill M MM 1 1 Mill 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Sbjct : 5218 acttcaatgtcactgttttcgggaaggaactgcacttgcg--tgcggcccaatcggaggt 5161 
Query: 3 80 tggtagtgccaggatcctcagtggagtggcaggaggattttcgggagctgttccggcagc 439 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 

Sbjct : 5160 tggtagtgccaggatcctcagtggagtggcaggaggattttcgggagctgttccggcag- 5102 
Query: 440 ccttacggcaggagtgtgtgtacactggaggtgtcactggaatgcctggggcagctgttg 499 

1 1 I 1 1 M M 1 1 I I 1 1 1 1 1 1 I I ]( 1 1 1 [ I I I [ I [ I I I ! I I [ [| I II I I I I I I I 1 1 ! M I 1 1 

Sbjct: 5101 ccttacggcaggagtgtgtgtacactggaggtgtcactggaatgcctggggcagctgttg 5042 
Query: 500 ccatcagcaactgtgacggattgg 523 

1 1 1 1 ! 1 1 1 1 1 1 1 II 1 1 1 II I 1 1 1 1 

Sbjct: 5041 ccatcagcaactgtgacggattgg 5018 



Score = 133 bits (67), Expect = 7e-28 
Identities = 79/83 (95%), Gaps = 3/83 (3%) 
Strand = Plus / Minus 

Query: 1 atggctccactccgcgcgctgctgtcctacctgctgcctttgc-actgtgcgctctgcrc 59 

IIMMIMMIIIIIMI IIMMMMMIIMIIIIMI Mill 1 1 1 1 1 1 1 1 I 

Sbjct: 7207 atggctccactccgcgcgctgctgtcctacctgctgcctttgcaactgt-cgctctgcgc 7149 
Query: 60 cgccgcgggcagccggaccccag 82 

I MM I II MM IMIIII II I 

Sbjct: 7148 c-ccgcgggcagccggaccccag 7127 



>AC06953 8 . 10. 1 . 169772 

Length = 169772 

Score = 654 bits (329), Expect = 0.0 
Identities = 336/338 (99%), Gaps = 1/338 (0%) 
Strand = Plus / Minus 

Query: 3 337 tcactgccccccttctccactcctggaagccccttaccaggaccccaggaccctgcagat 3396 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 

Sbjct : 132825 tcactgccccccttctccactcctggaagccccttaccaggaccccaggaccctgcagat 132766 
Query: 33 97 gctgcagagcctcctggaaagccaacgggatcagaggaccatcagcatggccgagccaca 3456 

qH . , MIIIIIMIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

bojct : 132765 gc tgcagagcc tec tggaaagccaacgggatcagaggaccatcagcatggccgagccaca 132706 



Query: 3457 cagctcccaggagctctggatacaagctccccagggacccagcatccctttgcccctgag 3516 

1 1 1 1 ! 1 1 1 1 1 1 [ 1 1 1 1 r 1 1 ! i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i j 1 1 1 1 1 1 1 

Sbjct: 132705 cagctcccaggagctctggatacaagc tccccagggacccagcatccc tttgcccctgag 132646 
Query: 3517 acaccaatccctggagcatcctggagcatctcccctaccacccccggggggctgccttgg 3576 

IIIMIMIIIIIIIIIIIMMIIIIIIIIIIIIIIIIII IIIMIMIMIMIIIII 

Sbjct: 132645 acaccaatccctggagcatcctggagcatctcccctaccacccccggggggctgccttgg 132586 
Query: 3577 ggctggactcagacacctacgccagtccctgaggacaaagggcaacctggagaagacctg 3 63 6 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIMIIIIIIIIMMIIIIIIII 

Sbjct : 132585 ggctggactcagacacctacgccagtccctgaggacaaagggcaacctggagaagacctg 132526 
Query: 3637 agacatcccggcaccagcctccctgctgacctgcccgg 3674 

Mill IIIIMI Mill IMIIMMII 1 1 1 Mill 

Sbjct: 132525 agacatcccggcaccagcctccctgctg-cctccccgg 132489 



Score = 405 bits (204), Expect = e-110 
Identities = 207/208 (99%) 
Strand = Plus / Minus 

Query: 2730 gtgggtgacggaggagtggggtgcctgcagccggagctgtgggaagctgggggtgcagac 27 89 

I I 1 1 I I I I 1 1 1 1 1 I i I I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 I I I I I I M i 1 1 I I 1 1 

Sbjct : 139531 gtgggtgacggaggagtggggtgcctgcagccggagctgtgggaagctgggggtgcagac 139472 
Query: 279 0 acgggggatacagtgcctgctgcccctctccaatggaacccacaaggtcatgccggccaa 2 849 

M 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 ii 1 1 1 1 ii 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 1 1 

Sbjct : 139471 acgggggatacagtgcctgctgcccctctccaatggaacccacaaggtcatgccggccaa 139412 
Query: 2 850 agcctgcgccggggaccggcctgaggcccgacggccctgtctccgagtgccctgcccagc 29 09 

IIMM IMIIMIII jMllllllillllllMIIIMIIMIII IMMIIIMM 

Sbjct : 139411 agcctgtgccggggaccggcctgaggcccgacggccctgtctccgagtgccctgcccagc 139352 
Query: 2910 ccagtggaggctgggagcctggtcccag 2 937 

I I I I I I I I I I i I I I I I I I I I I I ! I I i I 

Sbjct: 139351 ccagtggaggctgggagcctggtcccag 139324 



Score = 340 bits (171), Expect = 5e-90 
Identities = 171/171 (100%) 
Strand = Plus / Minus 

Query: 2427 ggctctccccccaactgagggtggcccccgcagcagcctggcctacaagtacgtcatcca 248 6 

MMIMIMMIMIMMIMIIIIMMIIMIIIIIIIIII IIIIIIIIIIIMII 

Sbjct : 141854 ggctctccccccaactgagggtggcccccgcagcagcctggcctacaagtacgtcatcca 141795 



Query: 2487 tgaggacctgctgccccttatcgggagcaacaatgtgctcctggaggagatggacaccta 254 6 

lllllllllllllllll IMIIIIIIIIMIIIilMMIIIIIIIIIIIIMIIIIIII 

Sbjct : 141794 tgaggacctgctgccccttatcgggagcaacaatgtgctcctggaggagatggacaccta 141735 
Query: 2547 tgagtgggcgctcaagagctgggccccctgcagcaaggcctgtggaggagg 2597 

1 1 1 I I M I ! 1 1 I I I i I I 1 1 I I 1 1 I I I !! I I I ! 1 1 I I I 1 1 1 I 1 1 1! 1 1 I I I 

Sbjct: 141734 tgagtgggcgctcaagagctgggccccctgcagcaaggcctgtggaggagg 141684 



Score = 338 bits (170), Expect = 2e-89 
Identities - 173/176 (98%) 
Strand = Plus / Minus 

Query: 1749 cccagcctatggaggccgccygtgcttagggcccatgttcgagtaccaggtctgcaacag 18 08 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III M Mill I IIIIMII Mllllll IIMI M III I! 

Sbjct : 152344 cccagcctatggaggccgcctgtgcttagggcccatgttcgagtaccaggtctgcaacag 152285 
Query: 1809 cgaggagtgccctgggacctacgaggacttccgggcccagcagtgtgccaagcgcaactc 1868 

III I MMII I II III II 1 1 IIMI I MINI jllll MM Mill 1 1 MM 1 1 1 1 II 1 1 

Sbjct : 152284 cgaggagtgccctgggacctacgaggacttccgggcccagcagtgtgccaagcgcaactc 152225 
Query: 18 69 stactatgtgcaccagaatgccaagcacagstgggtgccctacgagcctgacgatg 1924 

1 1 M 1 1 1 II M 1 1 1 1 M M 1 1 M M M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II II I 

Sbjct: 152224 ctactatgtgcaccagaatgccaagcacagctgggtgccctacgagcctgacgatg 152169 



Score = 330 bits (166), Expect = 4e-87 
Identities = 166/166 (100%) 
Strand = Plus / Minus 

Query: 22 63 gtggtgaagaaccaggtcaccggcagcttcatcctcaaccccaagggcaaggaagccaca 23 22 

II 1 1 1 1 1 f 1 1 1 M 1 1 1 M 1 1 ! 1 1 M M 1 1 1 II 1 1 1 1 1 1 1! 1 1 II 1 1 1 1 1 1 II I M 1 1 1 M 

Sbjct : 143 510 gtggtgaagaaccaggtcaccggcagcttcatcctcaaccccaagggcaaggaagccaca 143451 
Query: 2323 agccggaccttcaccgccatgggcctggagtgggaggatgcggtggaggatgccaaggaa 2 3 82 

IMIIIIIIIIIIIIMII IMMMMMMIMMMMMMMIMMMIMM 

Sbjct : 143450 agccggaccttcaccgccatgggcctggagtgggaggatgcggtggaggatgccaaggaa 143391 
Query: 2383 agcctcaagaccagcgggcccctgcctgaagccattgccatcctgg 2428 

MMMMMMIIIIIMI Mill IIIIIIIEIIIIIMMIIII 

Sbjct: 143390 agcctcaagaccagcgggcccctgcctgaagccattgccatcctgg 143345 



Score = 306 bits (154), Expect = 7e-80 
Identities = 158/159 (99%), Gaps = 1/159 (0%) 
Strand = Plus / Minus 



Query: 952 cagtccctgagcctgatcgagcgcgggaacccctcacgcagcctggagcaggtgtgtcgc 1011 

III I II 1 1 Mill MMMMIMM Ml IMIIIMM 1 1 M 1 1 IIIIMIIIMI I M 

Sbjct: 163232 cagtccctgagcctgatcgagcgcgggaacccctcacgcagcctggagcaggtgtgtcgc 163173 
Query: 1012 tgggcacactcccagcagcgccaggaccccagccacgctgagcaccatgaccacgttgtg 1071 

MINI 1 1 IMM IIIIIIIMMIMI IIIIIIIIMM M 1 1 1 MM MM II Mill 

Sbjct : 163172 tgggcacactcccagcagcgccaggaccccagccacgctgagcaccatgaccacgttgtg 163113 
Query: 1072 ttcctcacccggcaggactttgggccctcagggtatgca 1110 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 mill 

Sbjct: 163112 ttcctcacccggcaggactttgggccctcagg- tatgca 163075 



Score = 300 bits (151), Expect = 4e-78 
Identities = 151/151 (100%) 
Strand = Plus / Minus 

Query: 1598 agtggtgcttcaaaggtcactgcatctggaagtcgccggagcagacatatggccaggatg 1657 

II Mill 1 1 1 Ml I llllllll IMIIIIMIIIIIMM II I MMMIIIIIMI Ml 

Sbjct: 154491 agtggtgcttcaaaggtcactgcatctggaagtcgccggagcagacatatggccaggatg 154432 
Query: 1658 gaggctggagctcctggaccaagtttgggtcatgttcgcggtcatgtgggggcggggtgc 1717 

II MUM 1 1 1! 1 1 MMIIMMIMM MIMIIIM! I M IIMIIIIII! 1 1 1 1 M 

Sbjct : 154431 gaggctggagctcctggaccaagtttgggtcatgttcgcggtcatgtgggggcggggtgc 154372 
Query : 1718 gatcccgcagccggagctgcaacaacccctc 1748 

II I I Ml Mill MIMIIIIMIMII Ml 

Sbjct: 154371 gatcccgcagccggagctgcaacaacccctc 154341 



Score = 286 bits (144), Expect = 6e-74 
Identities = 144/144 (100%) 
Strand = Plus / Minus 

Query: 1209 gctcggcatggagcatgacggtcaggggaatggctgtgcagatgagaccagcctgggcag 12 68 

MIMIIi MUM llllllll IMIMMMMIMMIM IMMMIIMI Ml Ml 

Sbjct : 159446 gctcggcatggagcatgacggtcaggggaatggctgtgcagatgagaccagcctgggcag 159387 
Query: 12 69 cgtcatggcgcccctggtgcaggctgccttccaccgcttccattggtcccgctgcagcaa 132 8 

MIIMI I I I I I I I MIIIIMMIIMII IIMIIIIII II Ml I IMMIIMIII I 

Sbjct : 159386 cgtcatggcgcccctggtgcaggctgccttccaccgct tccattggtcccgctgcagcaa 159327 
Query: 1329 gctggagctcagccgctacctccc 1352 

MIIMI I M 1 1 1 1 IMIIMIM 

Sbjct: 159326 gctggagctcagccgctacctccc 159303 



Score = 274 bits (138), Expect = 2e-70 
Identities = 138/138 (100%) 
Strand = Plus / Minus 



Query: 2595 agggatccagttcaccaaatacggctgccggcgcagacgagaccaccacatggtgcagcg 2 654 

MIIIIIIIIIIIIIMIMIIMIIIIIIIIIIMIIIMIIIMIMIIIIMIIIII 

Sbjct: 141238 agggatccagttcaccaaatacggctgccggcgcagacgagaccaccacatggtgcagcg 141179 
Query: 2655 acacctgtgtgaccacaagaagaggcccaagcccatccgccggcgctgcaaccagcaccc 2714 

1 1 1 e i i j i 1 1 1 e 1 1 1 1 1 1 1 e 1 1 1 e 1 1 ! i 1 1 1 1 1 1 1 1 r j 1 I I 1 I r I I J 1 1 i 1 1 ^ f 1 1 1 1 1 1 

Sbjct: 141178 acacctgtgtgaccacaagaagaggcccaagcccatccgccggcgctgcaaccagcaccc 141119 

Query: 2715 gtgctctcagcctgtgtg 2732 

1 I I I II I II I I I I I I I I I 
Sbjct: 141118 gtgctctcagcctgtgtg 141101 



Score = 264 bits (133), Expect = 2e-67 
Identities = 134/135 (99%) 
Strand = Plus / Minus 

Query: 1924 gacgcccagaagtgtgagctgatctgccagtcggcggacacrggggacgtggtgttcatg 1983 

! II M II MM M MINI 1 1 MM I II MM! Mill I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 149784 gacgcccagaagtgtgagctgatctgccagtcggcggacacgggggacgtggtgttcatg 149725 
Query: 1984 aaccaggtggttcacgatgggacacgctgcagctaccgggacccatacagcgtctgtgcg 2043 

1 1 1 1 M II I II I I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I M I M 1 1 1 1 1 II 

Sbjct: 149724 aaccaggtggt tcacgatgggacacgctgcagctaccgggacccatacagcgtctgtgcg 149665 
Query: 2 044 cgtggcgagtgtgtg 2058 

1 1 1 1 1 1 1 1 1 1 1 1 1 i i 

Sbjct: 149664 cgtggcgagtgtgtg 149650 



Score = 264 bits (133), Expect = 2e-67 
Identities = 133/133 (100%) 
Strand = Plus / Minus 

Query: 1353 c tcctacgactgcctcctcgatgacccctttgatcctgcctggccccagcccccagagct 1412 

Ml 1 1 1 1 Ml 1 1 1 Mill II MM II I MM I llllll 1 1 1 MM I II Ml llllllll 

Sbjct : 158162 c tcctacgactgcctcctcgatgacccctttgatcctgcctggccccagcccccagagct 158103 
Query: 1413 gcctgggatcaactactcaatggatgagcagtgccgctttgactttggcagtggctacca 1472 

II 1 1 I 1 1 1 M 1 1 1 1 II M 1 1 1 1 1 M 1 1 II 1 1 1 1 1 II II 1 1 1 1 II I M 1 1 II 1 1 M II II 

Sbjct : 158102 gcctgggatcaactactcaatggatgagcagtgccgctttgactttggcagtggctacca 158043 



Query: 1473 gacctgcttggca 1485 

lllllllllllll 
Sbjct: 158042 gacctgcttggca 158030 



Score = 262 bits (132), Expect = le-66 
Identities = 133/134 (99%) 
Strand = Plus / Minus 

Query: 2 935 cagtgctctgccacctgtggagagggcatccagcagcggcaggtggtgtgcaggaccaac 2 994 

III I II li III Ml II 1 1 1 III 1 1 1 1 MM 1 1 1 1 II I I II 1 1 II I 1 1 1 1 Ml 1 1 1 1 I i M 

Sbjct : 135372 cagtgctctgccacctgtggagagggcatccagcagcggcaggtggtgtgcaggaccaac 135313 
Query: 2 995 gccaacagcctcgggcattgcgagggggataggccagacactgtccaggtctgcarcctg 3054 

1 1 1 1 1 1 1 i 1 1 ! 1 1 1 j I ] 1 1 1 [ 1 1 i I i [ ! I M 1 1 1 1 1 1 1 1 ! i I 1 1 1 1 J 1 1 1 r 1 1 1 i 1 1 1 1 

Sbjct : 135312 gccaacagcctcgggcattgcgagggggataggccagacactgtccaggtctgcagcctg 135253 
Query: 3 055 cccgcctgtggagg 3068 

i i 1 1 1 r 1 1 1 1 1 1 1 1 

Sbjct: 135252 cccgcctgtggagg 135239 



Score = 254 bits (128), Expect = 2e-64 
Identities = 128/128 (100%) 
Strand = Plus / Minus 

Query: 317 9 tgtgtgcagcggagccctgcacgggagacaggtctgtcttctgccagatggaagtgctcg 32 3 8 

1 1 1 1 ! I 1 1 1 ! I ! i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 I i i j M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct : 132983 tgtgtgcagcggagccctgcacgggagacaggtctgtcttctgccagatggaagtgctcg 132924 
Query: 323 9 atcgctactgctccattcccggctaccaccggctctgctgtgtgtcctgcatcaagaagg 32 98 

MIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIMI Ml 

Sbjct : 132923 atcgctactgctccattcccggctaccaccggctctgctgtgtgtcctgcatcaagaagg 132864 
Query: 3299 cctcgggc 3306 

MINIM 

Sbjct: 132863 cctcgggc 132856 



Score = 250 bits (126), Expect = 4e-63 
Identities = 126/126 (100%) 
Strand = Plus / Minus 

Query: 2 058 gcctgtcggctgtgacaaggaggtggggtccatgaaggcggatgacaagtgtggagtctg 2117 

IMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMII IMIIIIIIMIIIMI 



Sbjct : 1492 66 gcctgtcggctgtgacaaggaggtggggtccatgaaggcggatgacaagtgtggagtctg 149207 



Query: 2118 cgggggtgacaactcccactgcaggactgtgaaggggacgctgggcaaggcctccaagca 2177 

IIIIMIIIIIMIMIIIIIIIIIIMIIIIMIIIIMI IIIIIIMIIIIIMIIII 

Sbjct : 149206 cgggggtgacaactcccactgcaggactgtgaaggggacgctgggcaaggcctccaagca 149147 
Query: 217 8 ggcagg 2183 

MINI 

Sbjct: 149146 ggcagg 149141 



Score = 221 bits (111), Expect = 4e-54 
Identities = 112/113 (99%) 
Strand = Plus / Minus 

Query: 3 066 aggaaatcaccagaactccacggtgagggccgatgtctgggaacttgggacgccagaggg 3125 

1 1 1 1 1 1 E 1 1 j f 1 1 1 1 1 1 1 1 1 i i j I [ 1 1 1 1 1 1 1 [ [ i 1 1 E i I J 1 1 1 E 1 1 ! 1 1 1 1 i 1 1 1 1 1 1 [ 

Sbjct : 135158 aggaaatcaccagaactccacggtgagggccgatgtctgggaacttgggacgccagaggg 135099 
Query: 312 6 gcagtgggtgccacaatctgracccctacatcccattaacaagatatcatcaa 3178 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 135098 gcagtgggtgccacaatctgaacccctacatcccattaacaagatatcatcaa 135046 



Score = 219 bits (110), Expect = le-53 
Identities = 112/114 (98%) 
Strand = Plus / Minus 

Query: 1486 ttcaggacctttgagccctgcaagcagctgtggtgcagccatcctgacaaccmgtayttc 1545 

1 1 1 [ 1 1 1 1 1 1 1 1 M 1 1 1 i I M 1 1 1 M 1 1 1 1 M M 1 1 ! 1 1 1 1 1 1 f 1 1 1 M I j I Ml III 

Sbjct: 156651 ttcaggacctttgagccctgcaagcagctgtggtgcagccatcctgacaacccgtacttc 156592 
Query: 154 6 tgcaagaccaagaaggggcccccgctggatgggactgagtgtgcacccggcaag 1599 

IMMMIIIIIIIIIIIMMIIIIIIIIIMIIIIIIIMIIIIIII III! 

Sbjct: 156591 tgcaagaccaagaaggggcccccgctggatgggactgagtgtgcacccggcaag 156538 



Score = 219 bits (110), Expect = le-53 
Identities = 110/110 (100%) 
Strand = Plus / Minus 

Query: 1100 cagggtatgcacccgtcactggcatgtgtcaccccctgaggagctgtgccctcaaccatg 1159 

Mill 1 1 MM 1 1 MM III MM MINIM MMII Ml I III 1 1 M MM! I ! Ill 

Sbjct : 161080 cagggtatgcacccgtcactggcatgtgtcaccccctgaggagctgtgccctcaaccatg 161021 



Query: 1160 aggatggcttctcctcagccttcgtgatagctcatgagaccggccacgtg 12 09 



MINN M MINIMI III III M I IMIIIIIIMU 1 1 M I Mill 

Sbjct : 161020 aggatggcttctcctcagccttcgtgatagctcatgagaccggccacgtg 160971 



Score = 167 bits (84), Expect = 5e-38 
Identities = 84/84 (100%) 
Strand = Plus / Minus 

Query: 871 gtagatgagatttaccacgatgagtccctgggggttcatataaatattgccctcgtccgc 93 0 
Sbjct : 164037 gtagatgagatttaccacgatgagtccctgggggttcatataaatattgccctcgtccgc 163978 



Query: 931 ttgatcatggttggctaccgacag 954 

ii i iii i Minn mi ii i iii 

Sbjct: 163977 ttgatcatggttggctaccgacag 163954 



Score = 161 bits (81), Expect = 3e-36 
Identities = 81/81 (100%) 
Strand = Plus / Minus 

Query: 2178 ggcaggagctctcaagctggtgcagatcccagcaggtgccaggcacatccagattgaggc 2237 
Sbjct : 148110 ggcaggagctctcaagctggtgcagatcccagcaggtgccaggcacatccagattgaggc 148051 

Query: 2238 actggagaagtccccccaccg 2258 
Sbjct: 148050 actggagaagtccccccaccg 148030 



Score = 145 bits (73), Expect = 2e-31 
Identities = 73/73 (100%) 
Strand = Plus / Minus 

Query: 3 687 gccctgccatcccactggcacgtttacactctgtgtactgccccgtgactcccagctcag 3746 

M I I I II I 1 1 I I i I 1 1 1 I ! 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M I I I II I I 1 1 1 1 I I I I I 

Sbjct: 132475 gccctgccatcccactggcacgtttacactctgtgtactgccccgtgactcccagctcag 132416 
Query: 3747 aggacacacatag 3759 

IMIMMIIMI 

Sbjct: 132415 aggacacacatag 132403 



Score = 89.9 bits (45), Expect = le-14 
Identities = 66/73 (90%) 
Strand = Plus / Minus 



Query: 13 75 gacccctttgatcctgcctggccccagcccccagagctgcctgggatcaactactcaatg 1434 

1 1 1 1 1 1 1 1 1 1 1 II Mill IMIMIIIIIIIIIIIII I MINI III lllllll 

Sbjct : 158909 gacccctttgagcccacctggccccagcccccagagctgcccgggatcgacttctcaatg 158850 
Query: 1435 gatgagcagtgcc 1447 

Mill lllllll 

Sbjct: 158849 gatgaacagtgcc 158837 



>AC0 07484. 2 .23 991.2 92 66 
Length = 5276 

Score = 654 bits (329), Expect = 0.0 
Identities = 336/338 (99%), Gaps = 1/338 (0%) 
Strand = Plus / Minus 

Query: 3 337 tcactgccccccttctccactcctggaagccccttaccaggaccccaggaccctgcagat 33 96 

1 1 1 1 1 1 i i f 1 1 r f i j 1 1 1 r 1 1 1 1 j f r r j i j 1 1 1 1 1 1 1 1 1 1 1 j r r f 1 1 1 1 1 j 1 1 1 1 1 1 1 1 ( 

Sbjct: 2310 tcactgccccccttctccactcctggaagccccttaccaggaccccaggaccctgcagat 2251 
Query: 3397 gctgcagagcctcctggaaagccaacgggatcagaggaccatcagcatggccgagccaca 3456 

1 1 1 1 1 1 1 1 1 1 ii M 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ii ii 1 1 1 1 1 1 1 1 ii ii 

Sbjct: 2250 gctgcagagcctcctggaaagccaacgggatcagaggaccatcagcatggccgagccaca 2191 
Query: 3457 cagctcccaggagctctggatacaagctccccagggacccagcatccctttgcccctgag 3516 

lllllll Ml 1 1 IMIIIIIIII 1 1 IIIMIIIIIIIMI IIIIMIIIIMIIII III 

Sbjct: 2190 cagctcccaggagctctggatacaagctccccagggacccagcatccctttgcccctgag 2131 
Query: 3517 acaccaatccctggagcatcctggagcatctcccctaccacccccggggggctgccttgg 3576 

MIMM II II I I I MIMIMMIII II lllllllllllll MIIIIMIIIIIMI I 

Sbjct: 2130 acaccaatccctggagcatcctggagcatctcccctaccacccccggggggctgccttgg 2071 
Query; 3577 ggctggactcagacacctacgccagtccctgaggacaaagggcaacctggagaagacctg 3636 

„ „ IIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIII 

Sbjct: 2070 ggctggactcagacacctacgccagtccctgaggacaaagggcaacctggagaagacctg 2011 
Query: 3637 agacatcccggcaccagcctccctgctgacctgcccgg 3674 

I I i 1 1 1 1 1 i 1 1 1 1 1 1 1 I M 1 1 1 1 1 M M Ml Mill 

Sbjct: 2010 agacatcccggcaccagcctccctgctg-cctccccgg 1974 



Score = 262 bits (132), Expect = le-66 
Identities = 133/134 (99%) 
Strand = Plus / Minus 



Query: 2935 cagtgctctgccacctgtggagagggcatccagcagcggcaggtggtgtgcaggaccaac 2994 



MMIM MUM MINI I II MUM I III 1 1 1 1 III 1 1 1 MUM IIIIIIIMIII 

Sbjct: 4860 cagtgctctgccacctgtggagagggcatccagcagcggcaggtggtgtgcaggaccaac 4801 
Query: 2995 gccaacagcctcgggcattgcgagggggataggccagacactgtccaggtctgcarcctg 3054 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 [ i f 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

Sbjct: 4800 gccaacagcctcgggcattgcgagggggataggccagacactgtccaggtctgcagcctg 4741 
Query: 3055 cccgcctgtggagg 3068 

MMMMMMM 

Sbjct: 4740 cccgcctgtggagg 4727 



Score = 254 bits (128), Expect = 2e-64 
Identities = 128/128 (100%) 
Strand = Plus / Minus 

Query: 3179 tgtgtgcagcggagccctgcacgggagacaggtctgtcttctgccagatggaagtgctcg 3238 

M I I I I 1 1 I I M I 1 1 I M I I I I i I I I I I I I I I I I I I 1 1 M I I I 1 1 M I I 1 1 1 1 1 1 1 1 I 1 1 

Sbjct: 2468 tgtgtgcagcggagccctgcacgggagacaggtctgtcttctgccagatggaagtgctcg 2409 
Query: 3239 atcgctactgctccattcccggctaccaccggctctgctgtgtgtcctgcatcaagaagg 3298 

1 1 M M 1 1 M II 1 1 1 II 1 1 II I M II 1 1 1 1 M II I II 1 1 II 1 1 1 M II M I II 1 1 1 II 1 1 

Sbjct: 2408 atcgctactgctccattcccggctaccaccggc tctgctgtgtgtcctgcatcaagaagg 2349 
Query: 3299 cctcgggc 3306 

MINIM 

Sbjct: 2348 cctcgggc 2341 



Score = 221 bits (111), Expect = 4e-54 
Identities = 112/113 (99%) 
Strand = Plus / Minus 

Query: 3066 aggaaatcaccagaactccacggtgagggccgatgtctgggaacttgggacgccagaggg 3125 

Ac ! I I M I I 1 1 M M I 1 1 1 1 1 1 I I I II 1 1 II I II II I I I I II I I I I II I 1 1 1 1 1 II 1 1 I 1 1 1 

Sbjct: 4646 aggaaatcaccagaactccacggtgagggccgatgtctgggaacttgggacgccagaggg 4587 
Query: 3126 gcagtgggtgccacaatctgracccctacatcccat taacaagatatcatcaa 3178 

„ coc 1 1 1 1 1 1 M I M 1 1 1 ! M 1 1 1 II Mill II INI I MMIM II MMM III 

Sbjct: 4586 gcagtgggtgccacaatctgaacccctacatcccattaacaagatatcatcaa 4534 



Score = 145 bits (73), Expect = 2e-31 
Identities = 73/73 (100%) 
Strand = Plus / Minus 



Query: 3687 gccctgccatcccactggcacgtt tacactctgtgtactgccccgtgactcccagctcag 3746 

MMMIMM I I i I MIIMI I Mill I I M i Mill I ! I II I I I I I I I I I I I I MM I 

Sbjct: 1960 gccctgccatcccactggcacgtttacactctgtgtactgccccgtgactcccagctcag 1901 
Query: 3747 aggacacacatag 3759 

1 1 ii 1 1 1 1 1 1 1 1 1 

Sbjct: 1900 aggacacacatag 1888 



>AC007484. 2. 57778. 65289 
Length = 7512 

Score = 387 bits (195), Expect = e-104 
Identities = 195/195 (100%) 
Strand = Plus / Minus 

Query: 678 agcctttggcctgggagaccttcccaacctgctgggcctggtgggggaccagctgggcga 737 

MMMIMM M I II Mill I 1 1 1 MM III II 1 1 II II 1 1 1 1 1 II II I 1 1 1 i I III 1 1 

Sbjct: 2978 agcctttggcctgggagaccttcccaacctgctgggcctggtgggggaccagctgggcga 2919 
Query: 738 cacagagcggaagcggcggcatgccaagccaggcagctacagcatcgaggtgctgctggt 797 

MIIIIIIIIMI 1 1 1 llllllll Mill il MIIIIIM 1 1 1 II 1 1 II I MIIMI 1 1 

Sbjct: 2918 cacagagcggaagcggcggcatgccaagccaggcagctacagcatcgaggtgctgctggt 2859 
Query : 798 ggtggacgactcggtggttcgcttccatggcaaggagcatgtgcagaactatgtcctcac 857 

M M 1 1 1 1 1 1 II I II 1 1 II II 1 1 II II 1 1 1 1 II II II II I II II 1 1 M II 1 1 M II M II 

Sbjct: 2858 ggtggacgactcggtggttcgcttccatggcaaggagcatgtgcagaactatgtcctcac 2799 

Query: 858 cctcatgaatatcgt 872 

I I I I I I II I I I I I II 
Sbjct: 2798 cctcatgaatatcgt 2784 



>AC007484. 2 .112555.124636 
Length = 12082 

Score = 314 bits (158), Expect = 3e-82 
Identities = 158/158 (100%) 
Strand = Plus / Plus 

Query: 52 2 ggcgggcctcatccgcacagacagcaccgacttcttcattgagcctctggagcggggcca 5 81 

1 1 1 1 i 1 1 1 1 i 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I f I 

Sbjct: 10474 ggcgggcctcatccgcacagacagcaccgacttcttcattgagcctctggagcggggcca 10533 



Query: 582 gcaggagaaggaggccagcgggaggacacatgtggtgtaccgccgggaggccgtccagca 641 

MMMMIIMI I MMMMIMIMI II IMIIMM MIMMMMIMIMI M 

Sbjct : 10534 gcaggagaaggaggccagcgggaggacacatgtggtgtaccgccgggaggccgtccagca 10593 



Query: 642 ggagtgggcagaacctgacggggacctgcacaatgaag 679 

1 1 M 1 1 II 1 1 !l 1 1 1 Ml II 1 1 1 i 1 1 [ MM || 1 1 ( 1 1 

Sbjct: 10594 ggagtgggcagaacctgacggggacctgcacaatgaag 10631 



>AC010216. 8. 1.110753 

Length = 110753 

Score = 139 bits (70), Expect = le-29 
Identities = 101/112 (90%), Gaps = 4/112 (3%) 
Strand = Plus / Plus 



Query: 1490 ggacctttgagccctgcaagcagctgtggtgcagccatcctgacaaccmgtayttctgca 1549 

shier «, 7fi 111 Mi/ 11 11 '"MM II III 1 1 MM I llllll M I Mill 1 1 II II I II I 

bDnct: izvib ggacctttgacccctgcaagcagctgtggtgcagccatcctgacaacccctacttttgca 33035 

Query: 155 0 agaccaagaaggggcccccgct-ggatgggactga-gtgtgcacccggcaag 1599 

qh . t „ n „ Ml II 1 1 MM I II Ml II II III MUM I MM Mill llllll 

brrjct: 33036 agaccaagaaggggccccc-cttggacgggact-atgtgtgcacctggcaag 33085 



Score = 129 bits (65), Expect = le-26 ' 
Identities = 83/89 (93%) 
Strand = Plus / Plus 

Query: 1261 ctgggcagcgtcatggcgcccctggtgcaggctgccttccaccgcttccattggtcccgc 1320 

Sbncf 31087 ' I ' ' " ' ' ' I ' ' ' 1 ' ' ' ' ' ' ' ' 1 ' ' 1 1 1 1 1 ' "111111111111111 IMIIIIII 
beget. 31087 ctgggcagcatcatggcgcccctggtgcaggccgccttccaccgcttccactggtcccgc 31146 

Query: 1321 tgcagcaagctggagctcagccgctacct 1349 

qh . ^ ^ An MUM III llllll MIIMIMM 

SJDjct: 31147 tgcagccagcaggagctgagccgctacct 31175 



